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SCORE Search Results Details for Application 

10568337 and Search Result 
20071129_084935_20071129_084935_us-10 

568-337-2-p2n,rni, 



Score Home Retrieve A p plication SCORE System SCORE Comments / 

Page List Overview FAQ Sug g estions 

This page gives you Search Results detail for the Application 10568337 and Search Result 
20071129 J)84935_20071129J)84935_us-10-568-337-2.p2n.rni. 

Go Back to previous p age 



GenCore version 6.2.1 
Copyright (c) 1993 - 2007 Biocceleration Ltd. 

OM protein - nucleic search, using f rame_plus_p2n model 

Run on: November 29, 2007, 08:49:46 ; Search time 836 Seconds 

(without alignments) 
120.985 Million cell updates/sec 

Title : US- 10-5 68 -337-2 

Perfect score: 124 

Sequence: 1 MLRVLHRAASALVMATVIGLAPAVAFA 27 

Scoring table: BLOSUM62 

Xgapop 10.0 , Xgapext 0.5 

Ygapop 10.0 , Ygapext 0.5 

Fgapop 6.0 , Fgapext 7.0 

Delop 6.0 , Delext 7.0 

Searched: 5155175 seqs, 1873024446 residues 

Total number of hits satisfying chosen parameters: 10310228 

Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 

Listing first 1000 summaries 

Command line parameters: 

-THR=4 -MODEL=frame+_p2n. model -DEV=SOFT 

-Q-/abss /ABSSWEB_spool/US 10568337 /runat_291 12 007_08 4 935_2 67 15 /app_query. fasta_l 
-DB=Issued_Patents_NA -QFMT=fastap -SUFFIX-p2n . rni -SIMRANGE-0 . 0005 
-MINMATCH=0 . 1 -LOOPCL=0 -LOOPEXT=0 -UNITS=bits -START=1 -END=-1 
-MATRIX=blosum62 -TRANS=human4 0 . cdi -LIST=1000 -DOCALIGN=200 -THR_SCORE=pct 
-THR_MAX=100 -THR_MIN=0 -ALIGN=5 0 -MODE=LOCAL -OUTFMT=pto -NORM=ext 
-HEAPSIZE=0 -MINLEN=0 -MAXLEN=2 000000000 -MINDBSEQLEN=1 -HOST=ail-01 
-USER=US10568337_@CGN_l_l_134_@runat__2 9112007_08 4 935__2 6715 -NCPU=6 -NO^MMAP 
-NEG__SCORES=0 -WAIT -LONGLOG -MINDIST=10 -XGAPOP=10 -XGAPEXT-0 . 5 -FGAPOP=6 
-FGAPEXT=7 -YGAPOP=10 -YGAPEXT=0.5 -DELOP=6 -DELEXT=7 
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Database : Issued_Patents_NA : * 

1: /EMC_Celerra_SIDS2/ptodata/l/ina/l_COMB. seq: * 

2 : /EMC_Celerra_SIDS2/ptodata/l/ina/5_COMB. seq: * 

3: /EMC_Celerra_SIDS2/ptodata/l/ina/6A_COMB. seq: * 

4 : /EMC_Celerra_SIDS2/ptodata/l/ina/6B_COMB.seq: * 

5 : /EMC_Celerra_SIDS2/ptodata/l/ina/7A_COMB. seq: * 

6: /EMC_Celerra__SIDS2/ptodata/l/ina/7B_COMB.seq: * 

7 : /EMC_Celerra_SIDS2/ptodata/l/ina/H_COMB. seq: * 

8: /EMC_Celerra_SIDS2/ptodata/l/ina/PCTUS_COMB. seq: * 

9: /EMC_Celerra_SIDS2/ptodata/l/ina/PP_COMB.seq: * 

10: /EMC_Celerra_SIDS2/ptodata/l/ina/RE_COMB. seq: * 

11 : /EMC_Celerra_SIDS2/ptodata/l/ina/backf ilesl . seq: * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 



Result 
No. 


Score 


Query 
Match 


Length 


DB 


ID 










Description 
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Sequence 
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4 
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11 


Sequence 


11, Appl 


c 
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54 


43 


5 
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5 
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10 


-703 


-032- 


95089 


Sequence 


95089, A 


c 


6 


54 


43 


5 
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5 


US- 


10 


-703 


-032- 


91504 


Sequence 


91504, A 


c 


7 


54 


43 


5 


588 


5 


us- 


10 


-703 


-032- 


94706 


Sequence 


94706, A 


c 


8 


53 


42 


7 


419 


5 


us- 


10 


-703 


-032- 


49153 


Sequence 


49153, A 


c 


9 


52.5 


42 


3 


378 


5 


us- 


10 


-703 


-032- 


89469 


Sequence 


89469, A 


c 


10 


52 .5 


42 


3 


396 


5 


us- 


10 


-703 


-032- 


98346 


Sequence 


98346, A 


c 


11 


52.5 


42 


3 


418 


5 


us- 


10 


-703 


-032- 


98096 


Sequence 


98096, A 


c 


12 


52 . 5 


42 


3 


422 


5 


us- 


10 


-703 


-032- 


96699 


Sequence 


96699, A 


c 


13 


52 . 5 


42 


3 


425 


5 
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10 


-703 


-032- 


97933 


Sequence 


97933, A 


c 


14 


52.5 


42 


3 


428 


5 
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10 
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80385 


Sequence 


80385, A 


c 


15 


52.5 


42 


3 


433 


5 
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10 
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80412 


Sequence 


80412, A 


c 


16 


52.5 


42 


3 


438 


5 


us- 


10 


-703 


-032- 


49995 


Sequence 


49995, A 


c 


17 


52.5 


42 


3 


473 


5 


us- 


10 


-703 


-032- 


92026 


Sequence 


92026, A 


c 


18 


52.5 


42 


3 
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5 


us- 


10 


-703 


-032- 


55226 


Sequence 


55226, A 
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< ! --StartFragment--> ALIGNMENTS 
RESULT 1 

US-07-731-157A-1 

Sequence 1, Application US/07731157A 
Patent No. 5457032 
GENERAL INFORMATION: 

APPLICANT: Quax, Wilhelmus J. 
APPLICANT: Misset, Onno 
APPLICANT: Van der Laan, Jan M. 
APPLICANT: Lenting, Herman B.M. 

TITLE OF INVENTION: Mutated beta-lactam acylase genes 
NUMBER OF SEQUENCES: 50 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: COOLEY GODWARD CASTRO HUDDLESON & TATUM 
STREET: FIVE PALO ALTO SQUARE, 4TH FLOOR 
CITY: PALO ALTO 
STATE: CALIFORNIA 
COUNTRY: USA 
ZIP : 94306 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC-DOS/MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.25 (EPO) 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/07/731, 157A 
FILING DATE: 19910509 
CLASSIFICATION: 435 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: EP 90200962 
FILING DATE: 18-APR-1990 
ATTORNEY/AGENT INFORMATION: 

NAME: RAE-VENTER PH.D., BARBARA 
REGISTRATION NUMBER: 32,750 
REFERENCE/DOCKET NUMBER: GBRO-02 7 /00US 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 415-494-7 622 
TELEFAX: 415-857-0663 
TELEX: 380816 COOLEY PA 
INFORMATION FOR SEQ ID NO: 1: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 2163 base pairs 
TYPE: NUCLEIC ACID 
STRANDEDNESS : double 
TOPOLOGY: linear 
MOLECULE TYPE: DNA (genomic) 
HYPOTHETICAL: NO 
ANTI-SENSE: NO 
ORIGINAL SOURCE: 

ORGANISM: Pseudomonas species 
STRAIN: SY77 
FEATURE: 

NAME/KEY: CDS 
LOCATION: 1..2163 

IDENTIFICATION METHOD: experimental 
OTHER INFORMATION: /codon_s tar t= 1 

OTHER INFORMATION: /product= "Glutaryl-Cef acylase" 
OTHER INFORMATION: /evidence^ EXPERIMENTAL 
FEATURE: 

NAME/KEY : sig_peptide 
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LOCATION : 1..87 
FEATURE : 

NAME/KEY : mat_peptide 
LOCATION: 88. .594 

OTHER INFORMATION: /product= "glutaryl-cef acylase" 
OTHER INFORMATION: /label= alfa-subunit 
FEATURE: 

NAME/KEY : mat_peptide 
LOCATION: 595. .2163 

OTHER INFORMATION: /product^ "glutaryl-cef acylase" 
OTHER INFORMATION: /label= beta-subunit 
US-07-731-157A-1 

Alignment Scores: 

Pred. No.: 3.68e-ll Length: 2163 

Score: 124.00 Matches: 27 

Percent Similarity: 100.0% Conservative: 0 

Best Local Similarity: 100.0% Mismatches: 0 

Query Match: 100.0% Indels: 0 

DB: 2 Gaps: 0 

US-10-568-337-2 (1-27) x US-07-731-157A- 1 (1-2163) 

Qy 1 MetLeuArgValLeuHisArgAlaAlaSerAlaLeuValMetAlaThrVallleGlyLeu 20 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 ATGCTGAGAGTTCTGCACCGGGCGGCGTCCGCCTTGGTTATGGCGACTGTGATCGGCCTT 60 

Qy 21 AlaProAlaValAlaPheAla 27 

I II I I I I I I I I I I I I I I I I I I 
Db 61 GCGCCCGCCGTCGCCTTTGCG 81 

RESULT 2 
US-08-541-780-1 

Sequence 1, Application US/08541780 
Patent No. 5935831 
GENERAL INFORMATION: 

APPLICANT: Quax, Wilhelmus J. 
APPLICANT: Misset, Onno 
APPLICANT: Van der Laan, Jan M. 
APPLICANT: Lenting, Herman B.M. 

TITLE OF INVENTION: Mutated beta-lactam acylase genes 
NUMBER OF SEQUENCES: 50 
CORRESPONDENCE ADDRESS: 

ADDRESSEE: COOLEY GODWARD CASTRO HUDDLESON & TATUM 
STREET: FIVE PALO ALTO SQUARE, 4TH FLOOR 
CITY': PALO ALTO 
STATE: CALIFORNIA 
COUNTRY: USA 
ZIP: 94306 
COMPUTER READABLE FORM: 

MEDIUM TYPE: Floppy disk 
COMPUTER: IBM PC compatible 
OPERATING SYSTEM: PC- DOS /MS-DOS 

SOFTWARE: Patentln Release #1.0, Version #1.25 (EPO) 
CURRENT APPLICATION DATA: 

APPLICATION NUMBER: US/ 08 /5 4 1 , 7 8 0 
FILING DATE: 
CLASSIFICATION: 435 
PRIOR APPLICATION DATA: 

APPLICATION NUMBER: US/ 07 /7 3 1 , 157 
FILING DATE: 
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APPLICATION NUMBER: EP 90200962 
FILING DATE: 18-APR-1990 
ATTORNEY /AGENT INFORMATION: 

NAME: RAE-VENTER PH.D., BARBARA 
REGISTRATION NUMBER: 32,750 
REFERENCE/DOCKET NUMBER: GBRO-027 /0OUS 
TELECOMMUNICATION INFORMATION: 
TELEPHONE: 415-4 94-7 622 
TELEFAX: 415-857-0663 
TELEX: 380816 COOLEY PA 
INFORMATION FOR SEQ ID NO: 1: 
SEQUENCE CHARACTERISTICS: 
LENGTH: 2163 base pairs 
TYPE: nucleic acid 
STRANDEDNESS : double 
TOPOLOGY: linear 
MOLECULE TYPE: DNA (genomic) 
HYPOTHETICAL: NO 
ANTI-SENSE: NO 
ORIGINAL SOURCE: 

ORGANISM: Pseudomonas species 
STRAIN: SY77 
FEATURE: 

NAME /KEY : CDS 
LOCATION: 1..2163 

IDENTIFICATION METHOD: experimental 
OTHER INFORMATION: /codon_s tart= 1 

OTHER INFORMATION: /product= "Glutaryl-Cef acylase" 
OTHER INFORMATION: /evidence^ EXPERIMENTAL 
FEATURE: 

NAME/KEY : sig_peptide 
LOCATION: 1..87 
FEATURE: 

NAME/KEY : mat_peptide 
LOCATION: 88. .594 

OTHER INFORMATION: /product- "glutaryl-cef acylase" 
OTHER INFORMATION: /label= alfa-subunit 
FEATURE : 

NAME /KEY : mat_peptide 
LOCATION: 595. .2163 

OTHER INFORMATION: /product= "glutaryl-cef acylase" 
OTHER INFORMATION: /label- beta-subunit 
US-08-541-780-1 

Alignment Scores: 

Pred. No.: 3.68e-ll Length: 2163 

Score: 124.00 Matches: 27 

Percent Similarity: 100.0% Conservative: 0 

Best Local Similarity: 100.0% Mismatches: 0 

Query Match: 100.0% Indels: 0 

DB: 2 Gaps: 0 

US-10-568-337-2 (1-27) x US-08-541-780-1 (1-2163) 

Qy 1 MetLeuArgValLeuHisArgAlaAlaSerAlaLeuValMetAlaThrVallleGlyLeu 20 

I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
Db 1 ATGCTGAGAGTTCTGCACCGGGCGGCGTCCGCCTTGGTTATGGCGACTGTGATCGGCCTT 60 

Qy 21 AlaProAlaValAlaPheAla 27 

I I I I I I I I I I I ! I I I I I I I I I 
Db 61 GCGCCCGCCGTCGCCTTTGCG 81 
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< ! --EndFragment — > 
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SCORE Search Results Details for Application 
10568337 and Search Result 
20071128_153802._us-10-568-337-5.rng. 

Score Home Retrieve Application SCORE System SCORE Comments / 

Page List Overview FAQ Sug g estions 

This page gives you Search Results detail for the Application 10568337 and Search Result 
20071 128_153802_us-10-568-337-5.rng. 

Go Back to previous p age 



GenCore version 6.2.1 
Copyright (c) 1993 - 2007 Biocceleration Ltd. 



OM nucleic - nucleic search, using sw model 



Run on: 



November 29, 2007, 00:22:26 /Search time 646 Seconds 

(without alignments) 
1147.147 Million cell updates/sec 



Title : 

Perfect score: 
Sequence : 



US-10-568-337-5 
100 

1 atcctggttcgtacgcgccg ctgggtgagaggggaaatcc 100 



Scoring table: IDENTITY_NUC 

Gapop 10.0 , Gapext 1.0 



Searched: 



5620219 seqs, 3705283702 residues 



Total number of hits satisfying chosen parameters: 



11240438 



Minimum DB seq length: 0 

Maximum DB seq length: 2000000000 

Post-processing: Minimum Match 0% 

Maximum Match 100% 
Listing first 45 summaries 



Database : N_Geneseq_2007 01 : * 

1 : geneseqnl980s : * 

2 : geneseqnl990s : * 

3 : geneseqn2 000s : * 

4: - geneseqn2 OOlas : * 

5 : geneseqn2 001bs : * 

6: geneseqn2002as : * 

7 : geneseqn2002bs: * 

8 : geneseqn2 003as : * 

9 : geneseqn2 003bs : * 
10 : geneseqn2003cs : * 
11: geneseqn2003ds : * 
12 : geneseqn2004as : * 
13: geneseqn2004bs : * 
14 : gene seqn2 005s : * 
15 : gene seqn2 006s : * 
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16 : geneseqn2007s : * 

Pred. No. is the number of results predicted by chance to have a 
score greater than or equal to the score of the result being printed, 
and is derived by analysis of the total score distribution. 

SUMMARIES 

% 

Result Query 





No. 


Score 


Match 


Length 


DB 


ID 


Description 
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100 


X \J \J . 
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ADY34485 
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2 


100 


100 . 


n 


209 


14 


ADY34486 


Ady34 4 86 Pseudomon 
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1 00 
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1 00 
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n 
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ADY34499 


Ady344 99 Human int 




6 


100 
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14 
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7 


94.8 


Q4 


Q 

o 
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4 


AAH27732 


Aah27732 GL7ACA (a 




8 


94.8 


Q4 
y *i . 


Q 
O 
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5 


AAH27740 


Aah27740 G1-7ACA r 




9 


93.2 


Ql 
y «j . 




2482 


4 


AAI64747 


Aai64747 Pseudomon 


c 


10 


30. 6 


10 


D 


489 


12 


ADQ21106 


Adq21106 Human sof 


c 


11 


30. 6 


10 


U 


560 


6 


ABK35222 


Abk35222 Human cDN 


c 


12 


30. 6 


10 


u 


2222 


10 


ADL25751 


Adl25751 Human can 


c 


13 


30. 6 


10 


u 


2222 


10 


ADL25753 


Adl25753 Human can 


c 


14 


30.6 


10 


D 


4369 


12 


ADQ25015 


Adq25015 Human sof 


c 


15 


29.6 


9 9 




1032 


5 


AAH74579 


Aah74579 Nucleotid 




16 


29 


9 Q 
c y . 


n 


110000 


4 


AAI99682 19 


Continuation (20 o 




17 


29 


9 Q 
t. y . 




110000 


4 


AAI99683_19 


Continuation (20 o 


c 


18 


28.6 


98 


D 


46235 


15 


AEF73891 


Aef73891 Human cor 


c 


19 


28 . 6 


9 fl 
z. o . 


a 
u 


49999 


2 


AAZ23903 


Aaz23903 Human LOB 




20 


28.2 


28 . 


2 


706 


2 


ADR02076 


Adr02076 A. gossyp 




21 


28.2 


9fl 


9 
z. 


1329 


8 


ACA19622 


Acal9622 Prokaryot 


c 


22 


27.6 


9 7 


a 
\j 


346 


3 


AAA31766 


Aaa31766 Plant mic 


c 


23 


27.6 


9 7 


a 

u 


438 


3 


AAA31518 


Aaa31518 Plant mic 




24 


27.4 


27 . 


4 


193303 


12 


ADF13122 


Adfl3122 Hypermeth 




25 


27.4 


9 7 


4 


193303 


12 


ADF13115 


Adfl3115 Hypermeth 




26 


27.4 


21 . 


4 


193303 


12 


ADI37268 


Adi372 68 Hypermeth 




27 


27.4 


9 7 


4 


193303 


12 


ADI37261 


Adi372 61 Hypermeth 


c 


28 


27.2 
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Z / . 


z 


5286 


4 


ABL05391 


Abl05391 Drosophil 


c 


29 


27.2 


97 

Z / . 


o 
z 


8303 


4 


ABL05390 


Abl05390 Drosophil 




30 


27 


Z / . 


U 


1260 


11 


ACL26209 


Acl26209 Rice abio 


c 


31 


27 


27. 


0 


1387 


13 


ADX62879 


Adx62879 Plant ful 




32 


26.8 


26. 


8 


825 


14 


ACL70964 


Acl70964 M. xanthu 


c 


33 


26.8 


26. 


8 


2955 


13 


ADO84038 


Ado84038 Plant ful 




34 


26.8 


26. 


8 


10916 


14 


ACL64594 


Acl64594 M. xanthu 


c 


35 


26.8 


26. 


8 


33126 


12 


ADQ97546 


Adq97546 Mouse can 


c 


36 


26.6 


26. 


6 


441 


11 


ABD05702 


Abd05702 Pseudomon 


c 


37 


26.6 


26. 


6 


1317 


11 


ABD05624 


Abd05 624 Pseudomon 




38 


26.6 


26. 


6 


1398 


11 


ABD06048 


Abd06048 Pseudomon 


c 


39 


26.6 


26. 


6 


3406 


2 


AAX20567 


Aax20567 Polynucle 




40 


26.4 


26. 


4 


494 


14 


ACL55838 


Acl55838 Human col 




41 


26.4 


26. 


4 


500 


6 


ABL65055 


Abl65055 Lung cane 


c 


42 


26.4 


26. 


4 


1441 


11 


AEA90319 


Aea90319 Human NOV 


c 


43 


26.4 


26. 


4 


2331 


6 


ABK34734 


Abk34734 Human cDN 




44 


26.4 


26. 


4 


3707 


2 


AAQ82792 


Aaq82792 TRK1 gene 




45 


26.4 


26. 


4 


3708 


5 


AAH21448 


Aah21448 S. cerevi 
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ADX70106; 

05-MAY-2005 (first entry) 

Recombinant protein preparation method-related gaclss gene region SeqID8. 



<!--StartFragment-->RESULT 4 
ADX70106 

ID ADX70106 standard; DNA; 315 BP. 
XX 
AC 
XX 
DT 
XX 
DE 
XX 
KW 
KW 
XX 
OS 
XX 
PN 
XX 
PD 
XX 
PF 
XX 
PR 
XX 
PA 
XX 
PI 
XX 
DR 
XX 
PT 
PT 
PT 
XX 
PS 
XX 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
CC 
XX 
SQ 



recombinant protein; protein production; 
ds. 



fermentation; expression; 
interferon alpha 2; gene; 

Brevundimonas diminuta. 

WO2005017174-A2. 



24-FEB-2005. 

12- AUG-2004; 2 004WO-EP009055 . 

13- AUG-2003; 2003US-04 94 915P . 
(SANO ) SANDOZ AG. 
Stempfer G, Alliger P, Palma N; 
WPI; 2005-182386/19. 

Preparing recombinant polypeptides of interest, for producing large 
variety of polypeptides of interest, by fermenting prokaryotic host cell 
comprising a periplasm transformed with a recombinant expression system. 

Example 1; SEQ ID NO 8; 29pp; English. 

This invention relates to a novel method of preparing a recombinant 
polypeptide of interest which comprises fermenting a prokaryotic host 
cell comprising a periplasm transformed with a recombinant expression 
system capable of bringing secretion of a polypeptide of interest ■ into 
the periplasm of the host cell and extracting the polypeptide of interest 
from the periplasm. The method is useful for preparing a wide variety of 
recombinant polypeptides of interest such as human interferon alpha 2. 
The present sequence is that of a region of the B diminuta gaclss gene 
which was used in the exemplification of the invention. 

Sequence 315 BP; 55 A; 86 C; 113 G; 61 T; 0 U; 0 Other; 



Query Match 100.0%; Score 100; DB 14 

Best Local Similarity 100.0%; Pred. No. 6.9e-23 
Matches 100; Conservative 0; Mismatches 0 



Length 315; 

Indels 0; Gaps 0; 



Qy 



Db 



Qy 



Db 



1 ATCCTGGTTCGTACGCGCCGCCTACAAGTGGTGATCTAGGGGAACGTTCCGGGGGCGTCG 60 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
116 ATCCTGGTTCGTACGCGCCGCCTACAAGTGGTGATCTAGGGGAACGTTCCGGGGGCGTCG 17 5 

61 CTGCAACGGCGTCTCCGGATCTGGGTGAGAGGGGAAATCC 100 
I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I I 
176 CTGCAACGGCGTCTCCGGATCTGGGTGAGAGGGGAAATCC 215 



RESULT 5 
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ADY34499 

ID ADY34499 standard; cDNA; 807 BP. 
XX 

AC ADY34499; 
XX 

DT 05-MAY-2005 (first entry) 
XX 

DE Human interf eron-alpha 2B expression construct. 
XX 

KW Recombinant protein; interf eron-alpha; mutant; gac gene; signal peptide; 

KW gene; ss. 

XX 

OS Homo sapiens. 

OS Brevundimonas diminuta. 

OS Chimeric. 

OS Synthetic. 

XX 

FH Key Location/Qualifiers 

FT CDS 210. .788 

FT /*tag= b 

FT sig_peptide 210. .290 

FT ~ /*tag= a 

FT /note= "Gac gene signal sequence" 

FT mat_peptide 291. .785 

FT /*tag= c 

FT /product= "Human mature inter f eron-alpha 2B" 
XX 

PN WO2005017162-A2 . 
XX 

PD 24-FEB-2005. 
XX 

PF 12-AUG-2004; 2004WO-EP009067 . 
XX 

PR 13-AUG-2003; 2003US-0494 914 P . 
XX 

PA (SANO ) SANDOZ AG. 
XX 

PI Windisch J, Schoergendorf er K, Palma N, Knauseder F, Boehling H; 
XX 

DR WPI; 2005-182378/19. 

DR P-PSDB; ADY34498. 
XX 

PT New expression vector comprising a polynucleotide encoding a fusion 

PT protein comprising the signal sequence of the gac gene of Pseudomonas 

PT diminuta and a polypeptide other than gac, useful for producing 

PT polypeptides. 
XX 

PS Example 1; SEQ ID NO 19; 39pp; English. 
XX 

CC The invention provides a process for the efficient and direct production 

CC of a mature recombinant polypeptide in a prokaryotic host cell. A claimed 

CC expression vector comprises a polynucleotide encoding a fusion protein 

CC consisting of the signal sequence ADY34482 of the glutaryl 7- 

CC aminocephalosporic acid acylase (gac) gene of Pseudomonas diminuta and 

CC the polypeptide of interest. A prokaryotic host cell transformed with the 

CC vector is cultured under conditions which cause expression of the 

CC polynucleotide. Upon formation of the fusion protein, the signal sequence 

CC is cleaved off and the polypeptide of interest is released into the 

CC periplasm of the host cell. The expression vector is a plasmid, 

CC preferably a high copy plasmid. The vector further comprises a 

CC polynucleotide comprising the promoter region and the ribosomal binding 
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CC site ADY34485 or ADY34486 of the gac gene of P. diminuta. The culturing 

CC is performed as a multi-stage fermentation process comprising a shake- 

CC flask step, optionally a pre-culture step, and a main culture step. The 

CC main culture step is performed in a culture medium comprising a substrate 

CC for more than 90% of the cultivation time at a substrate concentration 

CC lower that the saturation constant of the substrate, accompanied by high 

CC levels of dissolved oxygen concentration, and further accompanied by a 

CC steadily decreasing specific growth rate of the bacterial host cells, the 

CC process being performed at a temperature that is lower than the optimum 

CC temperature for growth of the host cell. The substrate is glycerol or a 

CC carbohydrate, preferably glucose. The process is favorably used for the 

CC production of recombinant human interf eron-alpha 2B in Escherichia coli. 

CC The present sequence is that of an expression construct for recombinant 

CC production of human mature interf eron-alpha 2B in E. coli. It comprises 

CC the P. diminuta signal sequence, promoter and ribosome binding site, and 

CC the coding sequence ADY34481 for interf eron-alpha 2B in which codons have 

CC been modified to improve expression. The expression construct was 

CC obtained by a combination of chemical synthesis and PCR amplification. 
XX 

SQ Sequence 807 BP; 177 A; 215 C; 227 G; 188 T; 0 U; 0 Other; 

Query Match 100.0%; Score 100; DB 14; Length 807; 

Best Local Similarity 100.0%; Pred. No. 8.1e-23; 

Matches 100; Conservative 0; Mismatches 0; Indels 0; Gaps 0;" 

Qy 1 ATCCTGGTTCGTACGCGCCGCCTACAAGTGGTGATCTAGGGGAACGTTCCGGGGGCGTCG 60 

I I ! I I I I I I I I I I I I I I I I I I I I I I I I I I I I I i I I I I I I I I I I I I I I I i I I I I I I I I I I I 
Db 110 ATCCTGGTTCGTACGCGCCGCCTACAAGTGGTGATCTAGGGGAACGTTCCGGGGGCGTCG 169 

Qy 61 CTGCAACGGCGTCTCCGGATCTGGGTGAGAGGGGAAATCC 100 

I I I I i i I I I ! I 1 i I I I I I I I I I I I I I I I I I I I ! I ! I E I I I 
Db 170 CTGCAACGGCGTCTCCGGATCTGGGTGAGAGGGGAAATCC 2 09 

< ! — EndFragment--> 
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